1 |
C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The Multilingual TEDx Corpus for Speech Recognition and Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
An Information Retrieval Test Collection for English SMS Conversations
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Microblogging Temporal Summarization: Filtering Important Twitter Updates for Breaking News
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Frontiers, Challenges, and Opportunities for Information Retrieval – Report from SWIRL 2012, The Second Strategic Workshop on Information Retrieval in Lorne
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Formative Evaluation for Multilingual Multimedia Search and Sense-Making
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Advances in Multilingual and Multimodal Information Retrieval : 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers
|
|
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
11 |
Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Classifying Attitude by Topic Aspect for English and Chinese Document Collections
|
|
|
|
Abstract:
The goal of this dissertation is to explore the design of tools to help users make sense of subjective information in English and Chinese by comparing attitudes on aspects of a topic in English and Chinese document collections. This involves two coupled challenges: topic aspect focus and attitude characterization. The topic aspect focus is specified by using information retrieval techniques to obtain documents on a topic that are of interest to a user and then allowing the user to designate a few segments of those documents to serve as examples for aspects that she wishes to see characterized. A novel feature of this work is that the examples can be drawn from documents in two languages (English and Chinese). A bilingual aspect classifier which applies monolingual and cross-language classification techniques is used to assemble automatically a large set of document segments on those same aspects. A test collection was designed for aspect classification by annotating consecutive sentences in documents from the Topic Detection and Tracking collections as aspect instances. Experiments show that classification effectiveness can often be increased by using training examples from both languages. Attitude characterization is achieved by classifiers which determine the subjectivity and polarity of document segments. Sentence attitude classification is the focus of the experiments in the dissertation because the best presently available test collection for Chinese attitude classification (the NTCIR-6 Chinese Opinion Analysis Pilot Task) is focused on sentence-level classification. A large Chinese sentiment lexicon was constructed by leveraging existing Chinese and English lexical resources, and an existing character-based approach for estimating the semantic orientation of other Chinese words was extended. A shallow linguistic analysis approach was adopted to classify the subjectivity and polarity of a sentence. Using the large sentiment lexicon with appropriate handling of negation, and leveraging sentence subjectivity density, sentence positivity and negativity, the resulting sentence attitude classifier was more effective than the best previously reported systems.
|
|
Keyword:
bilingual; classification; Computer Science; cross-language; facet; Information Science; Library Science; subtopic; test collection
|
|
URL: http://hdl.handle.net/1903/8150
|
|
BASE
|
|
Hide details
|
|
13 |
Overview of the CLEF-2006 cross-language speech retrieval track
|
|
|
|
In: Oard, Douglas W., Wang, Jianqiang, Jones, Gareth J.F. orcid:0000-0003-2923-8365 , White, Ryen W., Pecina, Pavel, Soergel, Dagobert, Huang, Xiaoli and Shafran, Izhak (2007) Overview of the CLEF-2006 cross-language speech retrieval track. In: CLEF 2006: Workshop on Cross-Language Information Retrieval and Evaluation, 20-22 Sept. 2006, Alicante, Spain. (2007)
|
|
BASE
|
|
Show details
|
|
14 |
Investigating cross-language speech retrieval for a spontaneous conversational speech collection
|
|
|
|
In: Inkpen, Diana, Alzghool, Muath, Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Oard, Douglas W. (2006) Investigating cross-language speech retrieval for a spontaneous conversational speech collection. In: HLT-NAACL 2006 - The Human Language Technology Conference - North American Chapter of the Association for Computational Linguistics Annual Meeting, 8-9 June 2006, New York, USA. (2006)
|
|
BASE
|
|
Show details
|
|
15 |
The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
16 |
TREC-9 Experiments at Maryland: Interactive CLIR
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
17 |
COMPLEX QUESTION ANSWERING BASED ON A SEMANTIC DOMAIN MODEL OF CLINICAL MEDICINE
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Comparing User-Assisted and Automatic Query Translation
|
|
|
|
In: DTIC AND NTIS (2005)
|
|
BASE
|
|
Show details
|
|
|
|